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Abstract — The nonlinear Fourier transform (NFT), a powerful 
tool in soliton theory and exactly solvable models, is a method for 
solving integrable partial differential equations governing wave 
propagation in certain nonlinear media. The NFT decorrelates 
signal degrees-of-freedom in such models, in much the same way 
that the Fourier transform does for linear systems. In this paper, 
this observation is exploited for data transmission over integrable 
channels such as optical fibers, where pulse propagation is 
governed by the nonlinear Schrodinger equation. In this trans- 
mission scheme, which can be viewed as a nonlinear analogue 
of orthogonal frequency division multiplexing commonly used in 
linear channels, information is encoded in the spectral amplitudes 
associated with nonlinear frequencies. Unlike most other fiber- 
optic transmission schemes, this technique deals with both 
dispersion and nonlinearity directly and unconditionally without 
the need for dispersion or nonlinearity compensation methods. 
This paper explains the mathematical tools that underlie the 
method. 

Index Terms — Fiber-optic communications, nonlinear Fourier 
transform, integrable channels, Lax pairs, Zakharov-Shabat 
spectral problem. 

I. Introduction 

AN EVOLUTION equation is a partial differential equa- 
tion (PDE) for an unknown function q(t, z) of the form 

q z = K(q), (1) 

where K(q) is an expression involving q and its derivatives 
with respect to t. In (Q~|i and throughout this paper, subscripts 
are used to denote partial derivatives with respect to the 
corresponding variable. 

The evolution equation (Q3 is a PDE in 1 + 1 dimensions, 
i.e., one variable t e M represents a temporal dimension and 
one variable z ^ represents a spatial dimension. In most 
mathematical literature, the roles of z and t are interchanged, 
so rather than a spatial evolution as in (Q]), mathematicians 
study temporal evolution. 

An important example of an evolution equation, and the 
one that motivates this work, is the stochastic nonlinear 
Schrodinger (NLS) equation governing the complex envelope 
of a narrowband signal in an optical fiber in the presence of 
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noise, given by (T), El 

«M4«M = « M | Q(T|i|| . 

+V{t,£), Q^e^<£.(2) 

Here £ denotes distance (in km) along the fiber; the transmitter 
is located at I = 0, and the receiver is located at I = C. The 
symbol r represents retarded time, measured in seconds, i.e., 
t = t — /3i£ where t is ordinary time and f3i is a constant, and 
Q(t,£) is the complex envelope of the signal propagating in 
the fiber. The coefficient fa, measured in s 2 /km, is called 
the chromatic dispersion coefficient, while 7, measured in 
W -1 km -1 , is the nonlinearity parameter. Finally, V(t, £) 
is bandlimited white Gaussian noise with in-band spectral 
density <7q (W/(km • Hz)) and autocorrelation 

E {V(t,£)V*(t',£')} = <t 2 5 w (t - r')S(l - £'), 

where Sw(x) = 2Wsmc(2Wx). It is assumed that Q(t,£) is 
bandlimited to W for all I, I ^ C. 

The stochastic NLS equation (f2]i models both chromatic 
dispersion (captured by the jP2d 2 Q/dr 2 term), which is 
responsible for temporal broadening, and the Kerr nonlinearity 
(captured by the 7|Q| 2 Q term), which is responsible for 
spectral broadening. Pulse propagation is governed by the 
tension between these effects and can be linearly dominated, 
nonlinearly dominated, or solitonic (in which case the two 
effects are balanced). The NLS equation defines a nonlinear 
dispersive waveform channel from Q(t, 0) at the transmitter 
to Q(t, C) at the receiver. 

Such a nonlinear dispersive waveform channel is a major 
departure from the classical additive white Gaussian noise 
and wireless fading channels in terms of analytical difficulty. 
Here the signal degrees-of-freedom couple together via the 
nonlinearity and dispersion in a complicated manner, making 
it difficult to establish the channel input-output map, even 
deterministically. Most current approaches assume a linearly- 
dominated regime of operation, or consider the nonlinearity 
as a small perturbation, and are geared towards managing 
and suppressing the (detrimental) effects of the nonlinear 
and dispersive terms. Inline dispersion management, digital 
backpropagation, and other forms of electronic pre-and post- 
compensation belong to this class of methods (see l3ll- lfT0l 
and references therein). 

In this paper we adopt a different philosophy. Rather 
than treating nonlinearity and dispersion as nuisances, we 
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seek a transmission scheme that is fundamentally compatible 
with these effects. We effectively "diagonalize" the nonlinear 
Schrodinger channel with the help of the nonlinear Fourier 
transform (NFT), a powerful tool for solving integrable non- 
linear dispersive partial differential equations (PDEs) ifTTl . 
\\2\. The NFT uncovers linear structure hidden in the one- 
dimensional cubic nonlinear Schrodinger equation, and can be 
viewed as a generalization of the (ordinary) Fourier transform 
to certain nonlinear systems. 

With the help of the nonlinear Fourier transform, we are able 
to represent a signal by its discrete and continuous nonlinear 
spectra. While the signal propagates along the fiber based on 
the complicated NLS equation, the action of the channel on 
its spectral components is given by simple independent linear 
equations. Just as the (ordinary) Fourier transform converts a 
linear convolutional channel y(t) = x(t) * h(t) into a number 
of parallel scalar channels, the nonlinear Fourier transform 
converts a nonlinear dispersive channel described by a Lax 
convolution (see Sec. Pil l into a number of parallel scalar 
channels. This suggests that information can be encoded (in 
analogy with orthogonal frequency-division multiplexing) in 
the nonlinear spectra. 

The nonlinear Fourier transform is intertwined with the 
existence of soliton solutions to the NLS equation. Solitons 
are pulses that retain their shape (or return periodically to 
their initial shape) during propagation, and can be viewed 
as system eigenfunctions, similar to the complex exponentials 
e JW *, which are eigenfunctions of linear systems. An arbitrary 
waveform can be viewed as a combination of solitons, associ- 
ated with the discrete nonlinear spectrum, and a non-solitonic 
(radiation) component, associated with the continuous nonlin- 
ear spectrum. 

The goal of this first article is to introduce the mathematical 
tools that underlie this approach to information transmission. 
These tools are sufficiently general to encompass not only 
the nonlinear Schrodinger equation, but also other completely 
integrable nonlinear dispersive PDEs. Thus, the transmission 
scheme described here can also be applied to any channel 
model in this general class. These tools are also described 
in mathematics and physics (see, e.g., ifTTl - lTPIl ); here we 
attempt to extract those aspects of the theory that are relevant 
to the engineering aspects of the information transmission 
problem. In subsequent articles, we will describe useful nu- 
merical methods and give performance results for actual fiber- 
optic systems. 

We will find it convenient to work with the nonlinear 
Schrodinger equation (f2]i in a normalized form. By changing 
variables 

G 1 * T ™ 



with Tn 



\P2\c 



to = V ' 2 an< ^ P = 7Z' we 8 et ^ e normalized NLS 
equation 

2, 



jq z (t, z) = q tt + 2\q(t, z)\ z q(t, z) + v(t, z), (4) 



where 



Throughout this paper, © will be our primary and motivating 
illustrative example. 

II. A Brief History of the Nonlinear Fourier 
Transform 

The nonlinear Fourier transform (also known as the in- 
verse scattering transform or 1ST) is a method for solv- 
ing certain nonlinear dispersive partial differential equations 
(PDEs). These are integrable PDEs, i.e., nonlinear differential 
equations exhibiting certain hidden linearity. There are sev- 
eral integrable equations having physical significance, among 
which is the NLS equation. The 1ST method was a result of 
extensive efforts in theoretical physics and applied mathemat- 
ics in 1960s, closely associated with the notion of solitons 
and is often used to predict their existence and properties in 
integrable models IfTTl . fl2l . 

In the 1950s, in one of the first dynamical-systems simula- 
tions performed on a computer 1141 . Fermi, Pasta and Ulam 
performed a numerical experiment to understand why solids 
have finite heat conductivity. They modeled the solid as a 
lattice with point masses at the lattice points coupled with 
springs each having a quadratic nonlinearity. To their surprise, 
rather than observing an equipartition of energy among all 
modes, energy cycled periodically among a few low-order 
modes. Such behavior implies that the nonlinear oscillator 
behaves somehow linearly. 

In the 1960s, Zabusky and Kruskal showed that equation 
of motion for the Fermi-Pasta-Ulam lattice in the continuum 
limit is a remarkable PDE called the Korteweg-de Vries (KdV) 
equation 1151 . known in the study of water waves. The KdV 
equation for the evolution of a real-valued pulse q(t, z) as a 
function of time t and distance z is 

q z = qqt + qm- (5) 

Zabusky and Kruskal found that © has pulse-like solutions 
whose shape is preserved (or varies periodically) during prop- 
agation. Furthermore, they made the surprising observation 
that when two such pulses are launched towards each other, 
despite their nonlinear interaction, they pass through each 
other without changing their shape. Zabusky and Kruskal 
coined the term soliton for such solutions, in recognition of 
their particle-like properties lfT5l . 

The spectacular properties of these solutions greatly ex- 
cited the mathematics and physics communities and many 
researchers started to study solitons. In a celebrated paper 
ifTBI . Gardner, Greene, Kruskal and Miura uncovered some 
of the deep structure underlying the KdV equation which 
is responsible for solitons and their unusual properties. The 
authors of [ [161 were studying the celebrated linear Schrodinger 
equation from quantum mechanics, given by 



i/j z (t, z) = ip u (t, z) + g(t, z)tp(t, z)/3, 



(6) 



where ip(t, z) is the wave-function and g(t, z) is an external 
potential. They found that if one takes the solution q{t, z) of 
the KdV equation (0 as the external potential in ©, then the 
eigenvalues of the Schrodinger operator 



E {v(t, z)v*(t', z')} = ^5 w {t - t')S(z - z'). 



H 



dt 2 



+ g(t,z)/3 



(7) 
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remain invariant during the evolution in z\ Based on this 
critical observation, they developed a method to recover the 
external potential g(t, z) = q(t, z) by solving an inverse 
problem for ©. The method analytically predicts soli ton 
solutions for the KdV equation, as observed earlier by Zabusky 
and Kruskal through numerical computations. They had in fact 
found the 1ST for the special case of the KdV equation. 

It was not immediately clear if the method developed in 
lfl6l could be generalized to other nonlinear PDEs, since it 
is not obvious if there exists a certain auxiliary operator, like 
the Schrodinger operator H, whose eigenvalues are preserved 
during the evolution. In a landmark paper published in 1968 
ifTTl . Lax put the theory on a firm mathematical footing. 
In particular, he established the mathematical relationship 
between the auxiliary operators with invariant eigenvalues 
(now called Lax pairs) and the original nonlinear equation. 
Once a Lax pair for a nonlinear PDE is found, a method along 
the lines of |fl6l can be applied to solve that PDE. 

Shortly afterwards, in 1972, Zakharov and Shabat found a 
Lax pair for the NLS equation in one spatial dimension (T), 
and thus established that this equation, too, could be solved in 
the same manner. Details of this method for the NLS equation 
were subsequently developed by Ablowitz and others (see 
IfTTl and references therein), who also referred to this scheme 
as the "nonlinear Fourier transform". After these discoveries 
from the 1960s and 70s, research into soli tons became an 
established area of research, lying at the intersection of applied 
mathematics and nonlinear physics. 

Nonlinear PDEs solvable by the NFT are called integrable 
equations or exactly solvable models |fT8l . These are usually 
Hamiltonian systems having an infinite number of conserved 
quantities, and include the KdV, NLS, modified KdV, and 
sine-Gordon equations, the Toda lattice, among others IfTTl . 
Ifl2l . These equations all exhibit similar properties, including 
the existence of soliton solutions. Signal processing problems 
(e.g., detection and estimation) involving soliton signals in the 
Toda lattice have been considered by Singer lfl9l . 

III. Canonical Lax Form for Exactly Solvable 
Models 

A. Lax Pairs and Evolution Equations 

We wish to consider linear differential operators whose 
eigenvalues are invariant during an evolution IfTTl . More 
precisely, we consider such operators defined in terms of a 
signal q(t, z) where the eigenvalues of the operator remain 
constant even as q evolves (in z) according to some evolution 
equation. 

To facilitate the discussion, it is useful to imagine a linear 
operator represented as a matrix; however, we must keep in 
mind that, when moving from finite-dimensional spaces to 
infinite-dimensional spaces (of e.g., functions and operators), 
some results do not carry over necessarily. The relevant prop- 
erties of linear operators needed for this paper are reviewed 
in Appendix lAl 

Let L(z) be a square matrix whose entries are functions 
of z. Clearly, the eigenvalues of this matrix are in general 
functions of z too. However, for some matrices, it might be 



the case that while the entries of the matrix change with z, the 
eigenvalues remain constant (independent of z). Such a matrix, 
if diagonalizable, should be similar to a constant diagonal 
matrix A, i.e., L(z) = G(z)AG~ 1 (z), for some similarity 
transformation G(z). 

This idea generalizes to operators. Let T~L be a Hilbert space, 
let V be some domain that is dense in H, and let L(z) : 
T> — » H be a family of bounded linear operators indexed by a 
parameter z |20l . If the eigenvalues of L(z) do not depend on 
z, then we refer to L(z) as an isospectral family of operators. 
If diagonalizable, it follows that for each z, L(z) is similar 
to a multiplication operator A (the operator equivalent of a 
diagonal matrix; see AppendixlAl. i.e., L(z) = G(z)AG~ 1 (z), 
for some operator G(z). 

Assuming that L(z) varies smoothly with z, we can consider 
the rate of the change (with respect to z) of L(z). We have 



dL(z) 
dz 



G'AcT 1 + GA (-G^G'G -1 ) 

G'G' 1 (GAG -1 ) - (GAG -1 ) G'G -1 
M{z)L{z) - L{z)M{z) = [M, L] , (8) 



where G = dG{z)/dz, M = G'G -1 , and [M,L] = ML - 
LM is the commutator bracket. In other words, every diag- 
onalizable isospectral operator L(z) satisfies the differential 
equation ([8j. 

Conversely, suppose M(z) is given and the (unknown) 
diagonalizable operator L(z) evolves according to $Q with 
initial condition L(0) = GoAoGq . Let G(z) be the (unique 
invertible) solution to G = MG with G(0) = G . One can 
easily verify that L(z) = G(z)AqG(z)~ 1 satisfies (8). Since 
the solution to a first-order differential equation is unique lETl . 
we see that L(z) is an isospectral family. 

The characterization of isospectral operators is therefore 
summarized in the following lemma IfTTl . 

Lemma 1. Let L(z) be a diagonalizable operator. Then L(z) 
is isospectral if and only if it satisfies 



dz 



(9) 



for some operator M. If L is self-adjoint (so that L is unitarily 
equivalent to a multiplication operator, i.e., L = GAG* ), then 
M must be skew-He nnitian, i.e., M* = —M. 

Proof: The proof was outlined above. The skew- 
Hermitian property of M can be shown by differentiating 
GG* =1. M 
It is important to note that L and M do not have to be 
independent and can depend on a common parameter, e.g., 
a function q(t, z), as illustrated in Fig. [JJ The isospectral 
property of the solution is unchanged. The commutator bracket 
[M, L] in (O can create nonlinear evolution equations for 
q(t, z) in the form 

dq 

dz 



K(q), 



where K(q) is some, in general nonlinear, function of q(t, z) 
and its time derivatives. An example of this is the KdV 
equation. 
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q(t,0) 



L{q(t,0)) 



q(t,z) 



L(q(t,z)) 



q(t,£) 



L(q,(t,C)) 



Constant Spectrum 



Fig. 1. An isospectral flow: the spectrum of L is held invariant even as 
q(t, z) evolves. 



Example 1 (KdV equation). Let D = denote the time- 
derivative operator, and let q(t, z) be a real-valued function. 
Finally, let 

1 



L = D + -q, and let 
M = 4 ( D 3 + -Dq + -qD 



(10) 



The Lax equation L z = [M, L] is easily simplified to 

t^Iz - g (qttt + qqt) + (some terms)!? = 0, (11) 

where is the zero operator. The zero-order term of this 
equation as a polynomial in D, which must be zero, produces 
the KdV equation q z = q ttt + qq t . Using (D k )* = (-l) k D 
(provable by integration by parts), it is easy to see that L 
and M are self-adjoint and skew-Hermitian, respectively, since 
they are real valued and involve even and odd powers of V, 
respectively. The eigenvalues of L are thus preserved from 
Lemma[T] Note that the L operator in this example is precisely 
the (linear) Schrodinger operator H given in (|7]). □ 

Definition 1. A pair of operators L and M, depending on 
z, are called a Lax pair (L, M) if they satisfy (O. Following 
Lemma [T] the eigenvalues of the L operator are independent 
of z. 



B. The Zero-Curvature Condition 

The eigenvalues of the operator L, which are constant in an 
isospectral flow, are defined via 



Lv = Xv. 



(12) 



Taking the z derivative of (TT2b and using the Lax equation 
L z = [M, L], we obtain (L - XI) (v z - Mv) = 0. Since L - 
XI vanishes only on eigenvectors of L, it must be that v z — Mv 
is proportional to an eigenvector, i.e., v z — Mv = av, a e 
C. The choice of a does not influence the results of future 
sections; thus for simplicity we set a = 0. It follows that an 
eigenfunction v(t, z) evolves based on the linear equation 

v z = Mv. (13) 

Furthermore, it is often more convenient to re-write (fT2l as 



for some operator P. The relationship between P and L can 
be derived (if necessary) by combining (DI — P) v = with 
(L — XI) v = 0, obtaining 



P = S(L - XI) + DI, 



(15) 



as in 



where £ is some invertible operator, and D = 
Example Q] 

Combining equations ( TT3l ) and (TPfl) by using the equality 
of mixed derivatives, i.e., vt z = v z t, the Lax equation (O is 
reduced to the zero-curvature condition HI 21 



P z - M t + [P, M) = 0. 



(16) 



Note that the nonlinear equation derived from ( fT6l ) results as a 
compatibility condition between the two linear equations ( fT3l ) 
and ( TPfl i. This shows that certain nonlinear equations possess 
a "hidden linearity" in the form of dT3l > and ( fT4b . 

Following the work of Zakharov and Shabat on the NLS 
equation |1|, Ablowitz et al. l22l suggested that for many 
equations of practical significance, the operator P can be fixed 
as 

-jX r(t, zY 
s(t, z) jX 



P 



(17) 



where r(t,z) and s(t, z) are functions — depending on 
q(t, z) — to be determined to produce a given nonlinear evolu- 
tion equation. From (T5[ this corresponds to the L operator 



L 



J 



D 

s(t,z) 



-r(t,z) 
-D 



with £ = diag(j, —j). In this case, both L and P operate on 
2x1 vector functions. 

Equation (fl4l . with P as in (fTTb . is known as the AKNS 
system (after the authors of ll22l ) and is central in the study of 
the nonlinear Fourier transform fl2| . The important special 



case where r(t, z) = q(t, z) and s(t, z) 



-q*(t,z) 



generally known as the Zakharov -Shabat system. We will 
refer to lfl2l l. (fl3] l and dUJi as the L-, M-, and P-equations, 
respectively. Throughout this paper, we will assume that the 
domain of L, P, and M are subsets of the Hilbert space L 2 (M.), 
denoted here by TL, depending on the particular structure of 
that operator. 

Example 2 (sine-Gordon equations). Let r(t, z) = —s(t, z) = 
\q t {t, z) in ([T7]i and let 



M 



j ( cos(q) -sin(g) 



4A \ - sin(g) - cos(q) 



(18) 



Then the zero-curvature equation is simplified to q tz = sin(g). 
Taking r = s = \qt and M as 

M = J_ ( cosh(q) - sinh(g)\ 
4A \sinh(g) - cosh(g) J ' 

gives q tz = sinh(g). 

Example 3 (Nonlinear Schrodinger equation). Take r 
s = — q* and 



(19) 
□ 

= q, 



v t = Pv, 



(14) 



M 



2jX 2 -j\q(t,z)\< 



-2Xq{t,z) - jq t (t,z) 



2Xq*(t,z)-jqt(t,z) -2jX 2 + j\q(t,z)\' 
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x(t) 



= q(t, 0) 


L z (q) = [M(q),L(q)] 


V(t) = q( 


> 


> 



Fig. 2. A system defined by Lax convolution. 



The zero-curvature equation is simplified to jq z (t,z) = 
q t t(t,z) + 2\q(t,z)\ 2 q(t,z). □ 

Example 4 (KdV equation revisited). Let r = -^=g, s = 
and 



M 



4jA - — + ^ - ^jA^ + ^= + ^ 



V3 



12V3 



-4jA 3 + ^ 



2* 
6 



(20) 

The zero-curvature equation leads to g z = qq t + qttt- d 

It should be noted from Examples Q] and [4] that the choice 
of L and M giving rise to a given nonlinear equation is not 
unique. Obviously one can scale L, or add a constant to L 
or M. In addition, both the Lax equation (0 and dTZb are 
unchanged under orthogonal transformations, i.e., replacing L 
and M with ELS T and £A/£ T , respectively, where E is 
a (constant) orthogonal matrix, i.e., E T E = /. Note further 
that it may be possible to choose two Lax pairs (Li,Mi) 
and (L2, M2) for a given equation such that L\ is self-adjoint 
and L2 is not self-adjoint. The eigenvalues of L\ and L2 are, 
respectively, real and complex; see Appendix [A] 

C. Lax Convolution and Integrable Communication Channels 

Linear systems traditionally have been described by linear 
constant coefficient differential equations. An example is the 
one-dimensional heat equation q z = c 2 qu, where c is the 
diffusion coefficient and q(t, z) represents the heat profile 
across a rod extending in space t, as time z goes on. From 
a systems point of view, this defines a linear time-invariant 
system from input x(t) = q(t,0) at z = to the output 
y(t) = q(t, C) at some z = C. The role of z is therefore 
just a parameter and once fixed (to z = C), the system 
is described by an impulse response (or a Green function) 
h(t; C), representing the underlying (linear) convolution. 

Following this analogy, we wish to define a system in terms 
of a Lax pair (L, M). Here, L and M are parametrized by a 
waveform q(t, z). Such a system accepts a waveform x(t) = 
q(t, 0) at its input and produces a waveform y(t) = q(t, C) 
at its output, according to the evolution equation induced by 
L z = [M,L]. The time-domain input-output map is thus 
given by an evolution equation of the form q z = K(q), 
obtainable from the Lax equation (0 (or its equivalent (fT6]l). 
We refer to such a system as an integrable system. Note that 
an integrable system is completely characterized by the two 
operators (L, M) and the parameter z = C, independent of the 
signals. We denote such a system using the triple (L, M; C). 

Definition 2 (Lax Convolution). We refer to the action of an 
integrable system S = (L, M; C) on the input q(t, 0) as the 
Lax convolution of q with S. We write the system output as 
q(t, C) = q(t, 0) * (L, M; C). See Fig. 



Definition 3 (Integrable communication channels). A wave- 
form communication channel C : x(t) x v(t, z) —* y{t) 
with inputs x(t) 6 i 1 (R) and space-time noise v(t, z) e 
i 2 (R, R+), and output y(t) e L 1 (R), is said to be integrable 
if the noise-free channel is an integrable system. 

Note that noise can be introduced into an integrable system 
in a variety of ways. In what follows, we assume that the 
signal-to-noise ratio is not too small, so that the stochastic 
system may justifiably be considered as a perturbation of 
the deterministic system. Furthermore, in this paper we limit 
ourselves to integrable channels with additive noise, i.e., 

q z = K(q) + v(t,z), q(t,0) = x(t), q(t,C)= y(t) , 

where v(t, z) is distributed band-limited noise. Note that this 
model is not in general equivalent to one in which noise is 
added (in lumped fashion) at the channel output. Here, the 
noise is distributed in space, and so interacts with the signal 
(in a potentially very complicated manner) according to the 
given evolution equation. 

In this paper, we develop a scheme for communication over 
integrable channels. By various choices of Lax pair (L,M), 
one can construct a variety of interesting channel models, 
mostly nonlinear, which go beyond the linear channel models 
typically studied in data communications. Interestingly, some 
linear channels can also be analyzed using this nonlinear 
spectral approach advocated here |23l . The central application 
(and motivation) for this work is fiber-optic communication, in 
which the channel model is given by the nonlinear Schrodinger 
equation (J2J, and for which a Lax pair was given in Example [3] 
In what follows, we first discuss the deterministic (noise-free) 
case and later treat noise as a perturbation of the deterministic 
system. 

IV. Nonlinear Fourier transform 

In this section, we assume that a function q(t, •) e i 1 (R) 
is given, and we define its nonlinear Fourier transform with 
respect to a given Lax operator L. As the notation q(t, •) 
implies, in this and the next section, the variable z can take on 
any value in the range [0, C] and is irrelevant in the forward 
and inverse transforms. We shall therefore omit the index z in 
what follows. 

As previously noted, for concreteness we carry through the 
development of the NFT for the case of the NLS equation ®, 
for which the P- and M-equations are given by 

-jX q(t, z) 

-q*(t,z) j A 

v z = M(X,q)v 

_( 2 ] \ 2 -j\q(t,z)\ 2 -2\q{t,z)-jq t (t,z) 



v t = P{X,q)v 



v, 



(21) 



\2Xq*(t,z)-jq*(t,z) -2jX 2 + j\q(t, z)\ 2 

respectively. Here the P-equation is the Zakharov-Shabat 
system. 

The nonlinear Fourier transform (and its inverse) are defined 
solely in terms of the temporal P-equation OTT i. The spatial 
7l/-equation (l22l is used in a brief, and critical, propagation 
step, but plays no role in the definition of the nonlinear Fourier 
transform. 
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If for a given A e C, the operator L — XI, here equivalent 
to DI — P representing d2TT i. is not invertible, we say that 
A belongs to the spectrum of L and v(t, A) represents its 
associated eigenfunction. In finite-dimensional Hilbert spaces 
of matrices the spectrum is a discrete finite set, called eigenval- 
ues. This may no longer be true in infinite-dimensional spaces 
of operators, where the eigenvalues (if they exist) may only 
be one part of the spectrum. See Appendix lAl 

The nonlinear Fourier transform of a signal x(t) with 
respect to an operator L in a Lax pair is defined via the spectral 
analysis of the L operator, which we consider next. 

A. Canonical Eigenvectors and Spectral Coefficients 

We wish to study solutions of (BTT i. in which vectors v(t) 
are considered as elements of the vector space W. We begin 
by equipping the vector space H with a symplectic bilinear 
form H. x H *-* C, which, for any fixed value of t e R, is 
defined as 

( V (t) , W(t)) s = Vl (t)w 2 (t) - V 2 (t)W! (t) . (23) 

Let us also define the adjoint of any vector v in H as 

«?(*) ' 



m ~ V -«?(*) 

The following properties hold true for all v and w in H: 

m V = — V, 

• (v,v} 3 = 0; 

• (v,w) s = -(w,v) s ; 

• (v,v) s = -(v,v) s = M 2 + \v2 1 2 ; 

• Tt( v ' w \ = ( l, t, w ) s +<f,w t > s ; 

• for every 2x2 matrix A, (Av, w) s + (v, Aw) s = 
tr(A)(v,w) s . 

There are generally infinitely many solutions v of ( l2TT i 
for a given A e C, parametrized by the set of all possible 
boundary conditions. These solutions form a subspace E\ 
of continuously differentiable 2x1 vector functions (an 
eigenspace). 

Lemma 2. For all vectors v(t) and w(t) in E\, 

1) 6e E x *, i.e., v t = P(\*,q)v; 

2) (v(t),w(t)y is a constant, independent of t; 

3) If(v(t),w(t)') s ¥= 0, then v and w are linearly indepen- 
dent and form a basis for E\; 

4) dim(Ex) = 2. 

Proof: Property [TJ follows directly from (f2Tb . To see |2), 
note that ^ t (v,w) s = (v t ,w) s + (v,w t ) s = (Pv,w) s + 
(v, Pw) s = tr(P)(v, w) s = 0. To see |3}, fix t and let 
u(t) e E\, then u(t) = a(t)v(t) + b(t)w(t) for some 
a(t) and b(t). Taking the symplectic inner product of both 
sides with w and v, we get a(t) = (u, w)J(v, w) s and 
b(t) = (u,v)J(w,v) s . From Property (u,w) s , (v,w) s , 
(u,v) s , and (w,v) s are all independent of t. It follows that 
a and b are also independent of t. Finally, |4]i follows from 

ED. □ 

An important conclusion of Lemma [2] is that any two 
linearly independent solutions u and w of d2"TT i provide a 
basis for the solution space. To choose two such solutions, 



t = — oo -^t 



t = oo 



C 2 (t,A*) 



Fig. 3. Boundary conditions for the canonical eigenvectors. 



we examine the behavior of the equation at large values of \t\. 
If we assume that q(t, •) — » as |f| — » oo, then, as |t| oo 
(f2TT> is reduced to 



-jA 



for large \t\, 



jA, 
which has a general solution 

«(t,A) -» [ae- jXt ,/3e :iXt ] T , a,/3eC. 

Two possible boundary conditions, bounded in the upper 
half complex plane (5(A) > 0), are 



v 2 (t,X) 



t —> +00 
e-^K t- 



x . 



(24) 
(25) 



We solve (f2Tb for a given A under the boundary conditions 
(l24l) -(l25l). and denote the resulting solutions for all t e R as 
A) and v 2 (t, A). We can also solve d2Tb for A* under the 
adjoint boundary conditions 



« 2 (t,A*)- 



(26) 



-1 



-co, 



giving rise to two solutions A*) and v 2 (t, A*). From 
Lemma|2]we have that f) 1 (i, A* ) and i> 2 (t,\*) are elements 
of i?A- These four eigenfunctions v 1 ^, A), v 2 (t, A), A*), 
and i' 2 (t, A*), all of them elements of E\, are called canonical 
eigenvectors. Fig. [3] illustrates the canonical eigenvectors at 
their boundaries. 

Lemma 3. Canonical eigenvectors satisfy: 

1) (v^VW&X). = (v 2 (t,X*),v 2 (t,X)) s = 1; 

2) {u 1 ^, A),?; 1 ^, A*)} and {v 2 (t, X),v 2 (t, A*)} are /nc/e- 
pendent sets in E\. 

Proof: 1) Since (u 1 , v 1 ) s is independent of t, using (l24l i and 
©, (S 1 ^, A*), u 1 ^, A)> s = <« 1 (+*, A*), ^(+00, A)> s = 1. 
2) Follows from 1) and Lemma [2] □ 

Choosing A*) and A) as a basis of E\, one can 
project w 2 (t, A), v 2 (t, A*) e E'a on this basis to obtain 

v 2 (t, A) = a(A)S 1 (i, A*) + 6(AK(*> A), (27) 
w 2 (<,A*) = 6*(A*)w 1 (t,A*)-a*(A*)v 1 (t,A), (28) 

where a(A) = (w 2 ,?; 1 )^ and 6(A) = (v 1 ,v 2 ) s . A crucial 
property, following from Lemma |2] is that a(A) and 6(A) are 
time-independent. The time-independent complex scalars a(A) 
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s 



(30) 



and 6(A) are called the nonlinear Fourier coefficients 1111 . 

ma. 

Since the nonlinear Fourier coefficients are time indepen- 
dent, to facilitate computing them, for simplicity we can 
send t to, e.g., +co where w 1 (+cx),A) and w 1 (+cX),A*) are 
known. The other two canonical eigenvectors v 2 and v 2 are 
then propagated from their boundary values v 2 (— oo, A) and 
v 2 {— co, A*) at t = —oo according to vf = P(X,q)v 2 and 
v 2 = P(X*,q)v 2 to obtain n 2 (oo,A) and u 2 (oo,A*). At this 
stage, we have available all four canonical eigenvectors at one 
time, namely, 

{i> 2 (+co, A), w 2 (+co, A), u x (+oo, X),v 1 (+ao, A)} . 

We can now project v 2 (+co, A) and £> 2 (+oo, A) onto the basis 
u 1 (+oo, A) and v 1 (+co, A) according to d27t-d28l to obtain 

[v 2 (+oo,A),v 2 (+oo,A)] = [« 1 (+ao,A),« 1 (+oo,A)] S, (29) 

where 

(a{\) 6* (A*) 
\b{\) -a*(A*), 

The matrix S is called the scattering matrix and contains 
the nonlinear Fourier coefficients ifTTl . Ifl2l . It is a function 
of q(t, •) and says how the solution to (f2Tb is scattered from 
t = —oo to t = +00. More precisely, the field v 2 {— oo, A) = 
[l,0] T e - -' is applied at t = — oo, where q is absent. This 
field evolves forward in time according to (l2Tl i. interacts with 
the signal (which can be viewed as an "obstacle") at finite 
values of t, and subsequently propagates towards t = +oo, 
where again q is absent. The field at t = +oo is measured and 
gives information about the "obstacle" as seen from a distance. 
Although not obvious from the development so far, we shall 
see in Section [VTI1 that the information measured at t = +oo, 
captured by a(A) and 6(A), is complete, in the sense that from 
this information we can retrieve q(t, •) entirely. In view of this 
interpretation, the nonlinear Fourier transform was historically 
referred to as the inverse scattering transform. 

B. The Nonlinear Fourier Transform 

The projection equations d27l i and ( |28] | that give a(A) and 
6(A) are well-defined if A e 1. From Lemma [2] Property 
1), we observe that the eigenspace is symmetric in A, i.e., 
if A is an eigenvalue then so is A*. Thus it is sufficient to 
consider the upper half complex plane C + = {A|3(A) > 0}. 
In this region, the boundary conditions on the basis vectors 
v and v 1 at t = oo decay and blow up, respectively. As a 
result, d27b is consistent in C + only if a(A) = 0. Eigenvalues 
in C + are therefore identified as the zeros of the complex 
function a(A) and they form the discrete (point) spectrum of 
the signal. We will see in the next section that the discrete 
spectrum corresponds to soliton pulses. 

Lemma 4. If q(t) e Li(R), a(A) is an analytic function of X 
on C+. 

Proof: From Lemma in Section [VTll if q(t) e Li(R) the 
scaled canonical eigenvectors u 1 (t, A)e~ jAt and v 2 (t, X)e : > xt 
are analytic functions of A in C + . Since 



is a combination of two analytic functions in C + , it is analytic 
in the same region. (Note, however, that 6(A), which is a 
combination of functions analytic in disjoint regions in C, may 
not be analytic in either of those regions.) □ 
A consequence of Lemma |4] is that the zeros of a(A) in 
C + are isolated points l24l . It follows that the Zakharov- 
Shabat operator for the NLS equation has two types of spectra. 
The discrete (or point) spectrum, which occurs in C + , is 
characterized by those Xj e C + satisfying 

a(Xj) = 0, j =1,2,..., N. 

The discrete spectrum corresponds to solitons, and in this case 
reduces to 



v 2 (t,X J ) = b(X 3 )v 1 (t,X J ). 

The continuous spectrum, which in general includes the whole 
real line 5(A) = 0, corresponds to the non-solitonic (or 
radiation) component of the signal. The continuous spectrum 
is the component of the NFT which corresponds to the 
ordinary Fourier transform, whereas the discrete spectrum 
has no analogue in linear systems theory. The reader is 
referred to Appendix [A] for a number of examples illustrating 
various notions of the spectrum associated with bounded linear 
operators; Example [9] is particularly relevant to the discussion 
here. 

To distinguish between the discrete and continuous spectra, 
we find it convenient to refer to discrete spectral values of A 
using the symbol Xj (with a subscript). Continuous spectral 
values are denoted as A (without a subscript). In general, Aj e 
C+, whereas A e R. 

For the purpose of developing the inverse transform, we find 
it sufficient to work with the ratios 



?(A) 



6(A) 
a(X) 



?(Ai) 



fr(Aj) 



da(A) 



where a'(Xj) denotes the derivative 

We can now formally define the nonlinear Fourier transform 
of a signal with respect to a given Lax operator L as follows. 

Definition 4 (Nonlinear Fourier transform ifTTIl . Ifl3l ). Let q(t) 

be a sufficiently smooth function in Xi(R). The nonlinear 
Fourier transform of q(t) with respect to a given Lax operator 
L consists of the continuous and discrete spectral functions 

q(X) : R >— C and q(X^) : C+ >-» C where 



9(A) 



I and q(Xj) 
6(A) 



a(AV 



& (Aj) 
a'(Xj) 



j = l,2,...,iV, 



in which Xj are the zeros of a(A). Here, the spectral coeffi- 
cients a(A) and 6(A) are given by 



o(A) 
6(A) 



lim v 2 e jXt , 

t^oo 

lim v 2 2 e- jX \ 



(31) 
(32) 



a(X)=(v 2 ,v 1 ) s 



(v 2 e? Xt ,v l e-* Xt ) s , 



where v 2 (t, A) is a solution of d2Tl ) under the boundary 
condition (f25t . □ 

To obtain the continuous spectral function q(X), A e R, 
it is not necessary to obtain a(A) and 6(A) separately. For 



SUBMITTED TO IEEE TRANSACTIONS ON INFORMATION THEORY 



convenience, one can instead write an explicit differential 
equation 

+ g(t)e 2 ^V(t, A) + q*{t)e-^ = 0, 



df 



y(-oo,A)=0, (33) 



and obtain q(X) = limt_ >00 y(t, A). 

Analogously, one can solve the second-order differential 
equation 



d 2 z(t,X) 
dt 2 



2jA + 



q t \ dz(t, A) 



q J dt 



+ \q\ 2 z(t,X)=0, 



*(-ao,A) = l, *±^*> =0,(34) 



and obtain a(A) = lim z(t, A). The discrete spectrum is 

t— >oo 

obtained as the zeros of a(A). 

Just hke the ordinary Fourier transform, the nonlinear 
Fourier transform can be computed analytically only in a few 
cases. An example is given in the next subsection. 



C. Example: Nonlinear Fourier Transform of a Rectangular 
Pulse 



Consider the rectangular pulse 



A, te[t 1 ,t 2 \; 
0, otherwise. 



Let T = t 2 -t x and T = t 2 + h. 

In this case P(X,q) is time-independent when t e [ii,^], 
and (f2TT > under the boundary condition dZSt can be easily 
solved in closed form. The canonical eigenvector v 2 is given 
by 



v 2 (t, A) = exp[(t-ti)P]v 2 (t 1 , X), v 2 (h, A) = Q | 

It follows that 

v 2 (co, A) = v 2 (t 2 , A) = cxp(PT)v 2 (t u X), 

where 



cxp(PT) = exp 



J A q 

-<f iA 



r 



cos(AT) -jAsin(AT) 



A - sin(AT) 



^ sin(AT) cos(AT) + j A sin(AT) J ' 



with A = y'A 2 + |A| 2 . The spectral coefficients are obtained 
from (ED and 03 as 

o(A) = (cos(AT) - jA sin (AT)) e^ T , 



6(A) 



-A* 



sin(AT)e 



The zeros of a(A) in C + , which satisfy 



jtan(TVl^4| 2 + A 2 ) 



1 + 



JAP 
A 2 



give rise to the discrete spectrum. The continuous spectrum is 
given by 

-l 



9(A) 



A* 



-2j\t 2 



1 - 4-cot(AT) ) 
jA / 



Note that as A — > 0, A — ► A, and one can see that in the 
limit of AT « 1 there is no discrete spectrum. Furthermore, 
the continuous spectrum tends to 

q (X) = -A*Te- jAT 'sinc(2T/), A = 2tt/, 

which is just the ordinary Fourier transform of the q(t). 

Fig. [4] shows the two spectra for T = 1 and various values 
of A. For small A, there is no discrete spectrum and the 
continuous spectrum is essentially just the ordinary Fourier 
transform of q(t). As A is increased, the continuous spectrum 
deviates from the ordinary Fourier transform and one or more 
discrete mass points appear on the ju> axis. 

D. Elementary Properties of the Nonlinear Fourier Transform 

Let q(t) <-» (q(X),q(Xk)) be a nonlinear Fourier transform 
pair. The following properties are proved in Appendix iBl 

1 ) (The ordinary Fourier transform as limit of the nonlinear 
Fourier transform): If 1 1 c?| | i, x « 1, there is no discrete 
spectrum and q(X) — > Q(X), where <5(A) is the ordinary 
(linear) Fourier transform of —q*(t) 



Q(A) 



00 

J q*(t)e~ 2jXt dt. 



2) (Weak nonlinearity): If \a\ « 1, then aq(X) «b aq(X) and 
aq(Xk) * aq(Xk)- In general, however, aq(X) ^ aq(X) 
and aq(X k ) / aq(X k ). 

3) (Constant phase change): e^q(t)(X) = e-^g(t)(A) and 
e^q(t)(X k ) = e^q(t)(Xky ^ _ 

4) (Time dilation): = |o|g(oA) and g(£) = |o|g(aA fc ); 

5) (Time shift): q(t - to) «- e~ 2 ^*o (g(A, )g(A fc )); 

6) (Frequency shift): q(t)e~ 2jut «-» (g(A - w), g(A fc - w)); 

7) (Lax convolution): If <?2(i) = Qi(t) * (L,M;C), then 
<f 2 (A) = ff(A,£)<fi(A) and <f 2 (A fc ) = H(X,C)qx(X k ). 
For the NLS equation, the channel filter is H(X, C) = 
cxp(-4jA 2 £). 

'30 

8) (Parseval identity): j \\q(t)\\ 2 dt = E + E, where 



N 



E=- f log(l+|g(A)| 2 )dA, B = 4|]S>(A J 0. 

-oo 

The quantities £ and £ represent the energy contained 
in the continuous and discrete spectra, respectively. 

In addition, we have the following properties related to the 
nonlinear Fourier transform 

1) (Causality and layer-peeling property): Let qi(t) and 
q 2 (t) be two signals with non-overlapping support, e.g., 
signals in a pulse train. Without loss of generality, 
assume that q\(t) is supported on t ^ to, and that 
q 2 {i) is supported on t > to- If (ai,6i) and (02,62) 
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-l -0.5 0.5 l 
R(A) 




(a) (b) 

Fig. 4. Discrete and continuous spectra of the square wave signal with T — 1 and (a) A = 1 (b) A = 2 (c) A = 6. 



(C) 



-1 -0.5 0.5 1 



(o(A),6(A)) = MA)A(A)) o (a 2 (A),6 2 (A)) 



are, respectively, the nonlinear Fourier coefficients of eigenvectors at z = C However, by proper scaling, one can 
qi(t) and <? 2 (t), then the nonlinear Fourier coefficients obtain the canonical eigenvectors at any z. 
of qi(t) + <? 2 (£) are given by For instance, focusing on v 2 (t, A; z), and changing variables 

to u 2 (t, A; z) = v 2 (t, A; z)e~ 2 ^ x z , we obtain u 2 = Pw 2 with 
boundary condition (l25l l for all z. Consequently, u 2 (i, A; z) is 
(ai(A)a 2 (A) - bi(A)6|(A*),ai(A)b 2 (A) + &i(A)a* (A*)i) canonical eigenvector for all z. By transforming (O, the 

evolution equation for u is asymptotically (at t = +co) given 

by 

,0 -4jA 2 z y 



That is to say, if we slice the signal in time in consec- 
utive portions according to a mesh —go < • • • < t-i < 
to < ti < ■ ■ ■ < oo, the nonlinear Fourier coefficients 
satisfy the Markov property 



u 2 (oo,A;z) 



2 i u (oo,A;z), 



{a k (\),b k (\);q(t),tk < t < t k+1 ) (a k +i(X), 6fc+i(A)) , 



which gives 



where (ajt(A), 6fe(A)) are the spectral coefficients calcu- 
lated from q(t) in —go < t < t k - 
2) (Purely imaginary eigenvalues) If II^Hl! < f, then there 
is no discrete spectrum. If q(t) is real, nonnegative, 
piecewise smooth, and single-lobe (non-decreasing for 
t < to and non-increasing for t > to), then there are 



precisely N=[% + 
imaginary and simple 1251I . 



ej eigenvalues, all purely 



V. Evolution of the Nonlinear Fourier Transform 

Derivation of the evolution of the nonlinear Fourier trans- 
form of a signal propagating based on the NLS equation pro- 
ceeds straightforwardly. As q(t, z) propagates, the eigenvalues 
of L are preserved and the eigenvectors of L propagate based 
on d22b . Assuming that q(t, z) and its time-derivative vanish at 
t = +CO for all z C during the propagation, then as t — » oo 
d22l is reduced to 

v >& g )^ { -2j\ 2 

Thus the boundary conditions (|24| | and ( |25T l are transformed 
to 



(35) 



v\t,X) 
v 2 (t,X) 



e jXt e 



-jxt e 2jyz 



t 



t 



+00, 



—00. 



These transformed boundary conditions are not consistent with 
the boundary conditions d24l i and d25l l used to define the 
canonical eigenvectors, due to the additional factors e- 2: > x z . 
As a result, the evolution of the canonical eigenvectors from 
z = 0, according to v z = Mv, does not lead to the canonical 



i 2 (oo, A; z) = w 2 (co, A; 0) 



w 2 (oo, A; z) = m 2 (go, A; 0)e~ ijX 2 
Using expressions (l3TT i and ( f32l > we obtain 

a{X,z) = \imu 2 (t,X;z)e jXt = Kmu|(i, A; 0)e jA * = a(A,0) 
b(X,z)= lim u 2 M,X;z)e- 3Xt = lim ul(t, X;0) e - 4jx2z e- 3Xt 

t—>ao t— >oo 

= fo(A,0)e- 4jA2z . 

In turn, the nonlinear Fourier transform propagates according 
to 



qi^z){X) 
q(t,z)(Xj) 



-4j\ 2 z 



?M)(A), 
e-^ x >q(t,0)(X 3 ), 
Xj(0), .7 = 1,2, 



..,N. 



(36) 



Note that, since a(A, z) is preserved under the evolution (i.e., 
is independent of z), the number of the discrete eigenvalues — 
which are zeros of a(X) — are also preserved. 

In summary, we see that the operation of the Lax convolu- 
tion in the nonlinear Fourier domain is described by a simple 
multiplicative (diagonal) operator, much in the same way that 
the ordinary Fourier transform maps y(t) = x(t) * h(t) 
to Y{u) = X(uj) ■ H(u). The channel filter in (06} is 
exp(— AjX 2 z). 

VI. An Approach to Communication over 
Integrable Channels 

Since the nonlinear Fourier transform of a signal is essen- 
tially preserved under Lax convolution, one can immediately 
conceive of a nonlinear analogue of orthogonal frequency divi- 
sion multiplexing (OFDM) for communication over integrable 
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channels. We refer to this scheme as nonlinear frequency 
division multiplexing (NFDM). In this scheme, the input 
output channel model is given by 

Y(X) = H(X)X{\) + Z(X) 
f(A,) = H(X 3 )X(X 3 ) + Z(Xj), 

where 1(A) = q(X, 0) and X(Xj) = q(X, 0) are spectra at the 
input of the channel, Y(X) = q(X, z) and Y(X 3 ) = q(X,z) 
are spectra at the output of the channel, Z(X) and Z(Xj) are 
effective noises in the spectral domain, and the channel filter 
is 

H(X) = e7^\ 

A bandlimited Gaussian noise in time domain generally maps 
to non-Gaussian noise processes in the spectral domain. 
The proposed scheme consists of two steps. 

• The inverse nonlinear Fourier transform at the transmit- 
ter (INFT). At the transmitter, information is encoded in 
the nonlinear spectra of the signal according to a suitable 
constellation on (l(A), l(Aj)J . The time domain signal 
is generated by taking the inverse nonlinear Fourier 
transform, 

q(t) = INFT (1(A), 1(A)) , 

and is sent over the channel. (The INFT is described 
formally in the next section.) 

• The forward nonlinear Fourier transform at the receiver 
(NFT). At the receiver, the (forward) nonlinear Fourier 
transform of the signal, 

(y(A),F(A)) =NFT(g(f,*)) 

is taken and the resulting spectra are compared against 
the transmitted spectra according to some metric 
d (1(A), 1(A); Y(A),f(A)). 

As q(t, 0) propagates in the time domain based on the com- 
plicated nonlinear equation, it is significantly distorted and, 
among other distortions, undergoes intersymbol interference 
(ISI) in a very complicated manner. Despite this, the nonlinear 
spectra of the signal, (l(A), X(Xj)J, are essentially held in- 
variant during propagation, up to a complex-valued scale factor 
(see d36ll). All the nonlinear spectral components propagate 
independently of each other, and the channel is decomposed 
into a number of linear parallel independent channels of the 
form yk = CfcXfc + n^, where denotes the effective noise 
in the fcth channel. By diagonalizing the channel in this way, 
the ISI is removed in the spectral domain. 

In this scheme, as in linear OFDM, communication ob- 
jectives, such as constellation design, coding and modulation 
are entirely formulated in the spectral domain. All available 
degrees of freedom, i.e., {Xj, X(Xj), 1(A)} can be generally 
modulated. Time domain constraints can be translated to 
constraints in the spectral domain. A power constraint, for 
instance, can be exactly transformed to a power constraint in 
the spectral domain with the help of the Parseval Identity. A 
bandwidth limitation is not directly and simply expressed in 
the nonlinear spectral domain; however, it appears that the 
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Fig. 5. Nonlinear frequency division channel model 

nonlinear spectrum of a signal bandlimited to W is indeed 
confined, approximately, to a vertical strip in the real line and 
the upper half complex plane, with a width proportional to the 
W. 

Fig. |5] illustrates the NFDM channel model that we use in 
this paper (and its subsequent parts) for data transmission over 
integrable channels such as optical fibers. 

VII. Inverse Nonlinear Fourier transform 

In this section, we describe the inverse nonlinear Fourier 
transform, which is a method for recovering the signal q(t, •) 
from its nonlinear Fourier transform (q(X), q(X 3 )). This is the 
opposite of what was done in the Section IIV1 and it gives 
further important insight into the mechanism of the NFT. As 
in Section [TV] the value of z is irrelevant here and the index 
z is thus suppressed. 

A. Riemann-Hilbert Factorization 

In the 1950s, before the publication of Gardner lfl6l . 
a method for retrieving the potential q{t) in the linear 
Schrodinger equation from the knowledge of the scattering 
matrix S was already known in quantum mechanics. It was 
later realized that this method is an instance of a Riemann- 
Hilbert problem in complex analysis 0~Z), lfl3l . Alternatively, 
the same task can be accomplished using the Gelfand-Levitan 
method fill , a different approach developed earlier in the 
context of inverse problems for Sturm-Liouville differential 
equations. Although either of these methods can be used for 
solving the inverse problem, in this paper we will use the 
Riemann-Hilbert method. 

To begin, canonical eigenvectors can be found in two ways. 
On the one hand, they are related to q(t) through the Zakharov- 
Shabat system (fSTJ. One can solve (f2lT> to explicitly express 
the canonical eigenvectors as a series involving q(t). On the 
other hand, canonical eigenvectors are related to the nonlinear 
Fourier transform via the projection equations (f27b-(f28b. The 
latter are two equations for four unknowns and in general 
cannot be solved. However, the unique properties of the 
canonical eigenvectors, which are analytic in disjoint regions 
of the complex plane, will allow us to find them in terms of the 
nonlinear Fourier transform. Note that this second derivation 
does not explicitly depend on q(t); however, by equating the 
canonical eigenvectors obtained from these two derivations, 
we can relate the signal q(t) to its nonlinear Fourier transform 
(?(A),g(A,)). 
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We find it helpful to briefly introduce the tool that we em- 
ploy in this section, namely the Riemann-Hilbert factorization 
problem in complex analysis. 

Definition 5 (Riemann-Hilbert factorization). The Riemann- 
Hilbert factorization problem consists of finding two complex 
functions f~(z) and f + (z), respectively, analytic inside and 
outside of a closed contour C in the complex plane, such that 
on C they satisfy the boundary condition ,f + (z) = g(z)f~(z) 
for some given g(z). 

It g(z) satisfies a Holder condition, it is possible to find 
both ,f + (z) and f~(z) everywhere in the complex plane from 
the given boundary condition. In Appendix [Cj we provide a 
brief overview of this problem and its solution. See l26l for 
more discussion. 



B. The Inverse Transform 

The inverse transform maps the two spectral functions 
(q(X),q(Xj)) to the signal q(t). 

As noted, we need to express the canonical eigenvectors 
in terms of the nonlinear Fourier transform. Below we will 
find it more convenient to work with canonical eigenvectors 
subject to fixed boundary conditions. Scaling the canonical 
eigenvectors as V 2 = v 2 e >xt , V 2 = z) 2 e — jA *, V 1 = v 1 e~i xt 
and V 1 = w 1 e jAt , the projection equations d27b and (f28b are 
transformed to 



V 2 (t, A) = a(A)F 1 (t, A) + b{X)e 2jXt V\ (37) 
Vi(t,\*) = -o*(A*)V 1 (t,A) +b*{\*)e- 2jXt V 1 . (38) 



Lemma 5. If q s then V 1 ^, A) and V 2 (t,X) are 

analytic functions of A in the upper half complex plane 
C+ = {z e C : > 0} while V^-faX) and V 2 (t,X) 

are analytic functions of A in the lower half complex plane 
C~ = {ze C : 9(z) < 0}. 



Proof: See Appendix ID1 
Rearranging (l37T>-(f38T> gives 



□ 



(y 1 v 2 ) = (v 1 




a*(A*) e 



system, is obtained: 



A -A, 

1 7 q(0e 2 ^V\t,C) 



I 



27rj J C - (A - je) 



-dC, 



V 



1 M) = n-s 



TV .. 



q*(X*)e-^V^t,Xi) 



A - A? 



i r j»(c t )c-wv 1 (t,0 



f 27rj J * Y-(A + je) '" dC 



TV - 



+ 



«(A i )e« A **V 1 (f > A ( ) 



A* — A, 



+ 



1 f gCOe^^^C) 



27Tj J 



C - A* 



dC, m=l,...,iV, 



V 



i (*.A m )= ; -s 



TV - 



^(A^e-W'^^A,) 



Am — Aj 



1 f g*(C)e- 2 ^yi(i,C) 



I 



C-A, 



m = 1,...,N, 



(40) 



This is a system of 2tV + 2 linear equations for 2A + 2 dis- 
crete and continuous canonical eigenvectors {V 1 ^, A m )}^ =1 , 
{V^.Am)}™^, and V x (i,A) and V^frA) as a function of 
the Fourier transform (g(A) , (?( A^)). 

Next, the canonical eigenvectors are related to the signal 
q(t). By inspecting the Zakharov-Shabat system d2Tl ). it is 
shown in Appendix [E] that for |A| » 1 



+ * J | 9 ( f )l*«<J + <^>- 



(41) 



a* (A*) 



a*(A*) 
a*(A*) C 



-2jAt 



Since (V^ 2 V^ 1 ) and \ V 2 V l j are analytic, respectively, 
in the upper and lower half-planes, d39t defines a matrix 
Riemann-Hilbert problem. 

Solution of the Riemann-Hilbert factorization problem (l39l 
is given in Appendix [0 Omitting the details, the following 
linear system of equations, known as the Riemann-Hilbert 



It now becomes easy to recover the signal q(t) from the 
nonlinear Fourier transform (q(X),q(Xj)), Eigenvector V 1 is 
related to q via (|4"TT i for |A| » 1, and is related to the 
nonlinear Fourier transform via the second equation in ( f40b . 
Approximating 1/(A - C) and 1/(A - A 4 ) by -1/A + 0(A" 2 ) 
in the second equation of d40b for |A| » 1 and comparing 
with Vi in ( Hll . we obtain 
(39) 

TV 

q*(t) = 2jY i q^i)e 2jXit V 2 \t,X i ) 

i=l 

- - P q(X)e 2jXt V 2 \t,X)dX. (42) 

^ J-00 

This represents q(t) as a function of the nonlinear Fourier 
transform. 

In summary, given (q(X), q(Xj)), we first solve the 
Riemann-Hilbert system to find the eigenvector V 1 . For this 
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purpose, one can discretize the system d40l > and solve a linear 
system of equations of the form 

Ax = b, 

for appropriate A and b. This is done for each fixed ti to 
determine the samples of V l (ti, X) at that time. Then, V 1 , q 
and q are substituted in ( l42l to obtain the signal q(t). 

Note that this inverse transform is taken only once at the 
transmitter to synthesize the desired pulse shapes. It is only 
the forward transform which needs to be taken in real time at 
the receiver. 

The mathematical tools developed in this paper are used in 
Parts II, III and IV of this paper to present further details 
pertaining the application of the suggested scheme in data 
transmission over integrable channels. 

In Part II, numerical methods are suggested to compute the 
nonlinear Fourier transform of a signal with respect to the 
AKNS Lax operator. We use the results of Part I and II in 
Part III and IV to study the NFT in the presence of the noise 
and illustrate the performance of the method in actual fiber- 
optic systems. 

VIII. Conclusions 

The nonlinear Fourier transform of a signal with respect 
to an operator I in a Lax pair consists of continuous and 
discrete spectral functions x(X) and x(Xj), obtainable by 
solving the eigenproblem for the L operator. The NFT maps 
a Lax convolution to a multiplication operator in the spectral 
domain. Using the nonlinear Fourier transform, we propose a 
transmission scheme for integrable channels, termed nonlinear 
frequency-division multiplexing, in which the information in 
encoded in the nonlinear spectrum of the signal. The scheme is 
an extension of traditional OFDM to any channel generated by 
a Lax pair. An example is the optical fiber channel, in which 
signals propagate according to the nonlinear Schrodinger equa- 
tion. The class of integrable channels, though nonlinear and 
often complicated, are somehow "linear in disguise," and thus 
admit the proposed nonlinear frequency-division multiplexing 
transmission scheme. 

Appendix A 
Spectrum of Bounded Linear Operators 

When moving from finite-dimensional spaces (of e.g., ma- 
trices) to infinite-dimensional spaces (of e.g., functions and 
operators), some results do not carry over necessarily. Here 
we recall a few useful results in functional analysis ||20l . 

Let TL be a Hilbert space, let V be a dense subset of TL, 
and let L : T> —* TL be an operator. 

Definition 6. The adjoint of L is the operator L* whose 
domain T>* consists of all ijj e TL for which there exists an 
element L*ip e TL such that 

The operator L is said to be self-adjoint if T> = T>* and 
L* = L. Ignoring domain subtleties (as is the case for bounded 
operators), self-adjoint operators are the analogue of Hermitian 
matrices. 



Definition 7. Given an operator L on a Hilbert space, an 
operator M is said to be the inverse of L if the domain of 
M is the range of L, the range of M is the domain of L, and 
ML = I and LM = I. 

We will restrict ourselves now to operators whose T> is the 
entire space TL. 

An operator L : TL — > TL is invertible if it 1) is one-to-one 2) 
is onto 3) has bounded inverse. In finite-dimensional spaces, 
only the first condition is required. 

An operator is bounded if it maps bounded inputs to 
bounded outputs. A bounded operator is invertible if it is one- 
to-one and onto. 

Definition 8. The spectrum of an operator L on TL is defined 

as 

a(L) = {A e C\L - XI is not invertible }. 

The spectrum of a bounded operator can be partitioned into 
two classes, depending on the reason that L — XI fails to be 
invertible. 

A complex number A is considered part of the discrete 
spectrum if L — XI is not one-to-one, i.e., Lip = Xip has a non- 
zero solution ip e H. In this case, A is called an eigenvalue, 
and each tp satisfying this equation for the given A is an 
associated eigenvector. The set of all eigenvalues is called the 
point spectrum or discrete spectrum of L, CTdisc(^)- 

It can also happen that L—XI fails to be surjective, i.e., 
the range of L — XI is a proper subset of TL. We call the set 
of A for which this happens the continuous spectrum cr cont (X) 

EDI. 

In finite-dimensional Hilbert spaces the spectrum is entirely 
discrete. This may no longer be true in infinite-dimensional 
spaces, where the eigenvalues (if they exist) may only be one 
part of the spectrum. The spectrum of a self-adjoint operator 
is real. In general, a (IS) = a(L*). 

The following examples illustrate some these possibilities. 

Example 5. The operator L(x(t)) = tx(i), x(t) e L 2 [0, 1], has 
no eigenvalues and its spectrum is purely continuous a(L) = 
[0,1]. □ 

Example 6. The Fourier transform operator F(q)(u) = 

'30 

$ q(i)e :,wt dt, regarded as on operator on L 2 (M.) has the 

— CO 

property F A = I. The eigenvalues are therefore the discrete 
values {+l,+j}. If p(t) is an arbitrary polynomial, then 
p(t) exp(— 1 2 /2) is an eigenfunction. □ 

Example 7. Let V 2 be the Laplace operator and let r denote 
the radial distance in the three-dimensional space. The operator 
L = — V 2 — - is self-adjoint, and therefore it has a real 
spectrum. The continuous spectrum is [0,oo) and the discrete 
spectrum is given by A„ = — 4^7, n = 1,2, □ 

Example 8. It is possible that the discrete spectrum of 
an operator is uncountable. For example, for a sequence 
(x ,x 1 ,...) e £ 2 = {x : Xi 4 \ x i\ 2 < °°}> let the left-shift 
operator L be defined as L (xq, x\, X2, • • •) = (xi, £2, • • •)■ 
The spectrum consists of the unit disk |A| 1. The portion 
|A| < 1 is the discrete spectrum while |A| = 1 is the contin- 
uous spectrum. The adjoint of L is the right-shift operator 
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R (xq, xi, X2, ■ ■ ■) = (0, xq, xi, . . .). This operator has the 
same total spectrum, but it is entirely continuous. □ 

Example 9. Consider the Zakharov-Shabat system (f2TT >. in the 
form (DI - P)v = 0, v e L 2 (R). One can see that for AeK, 
the eigenvectors v(t, A) contain oscillatory terms exp(+jAi), 
and thus are not in L 2 (R) (unbounded in norm). As a result, 
the operator (DI — P)^ 1 is unbounded and the inverse (DI — 
P)^ 1 does not exist on the Hilbert space L 2 (R). Thus A e 
K forms the continuous spectrum. This is generally the case 
where for an operator L on H for which the eigenproblem 
Lv = Xv has a solution with infinite norm. □ 

Note that some authors subdivide what we refer to as the 
continuous spectrum into further classes (e.g., residual spec- 
trum, essential spectrum, etc.) |20l ; however, for the purposes 
of this paper, classification into discrete and continuous spectra 
will suffice. 

Hermitian matrices are always diagonalizable, have real 
eigenvalues, and possess a complete set of orthonormal eigen- 
vectors, which provide a basis for the input space. There is 
a perfect generalization of this result to compact self-adjoint 
operators |20l . 

Theorem 6 (Hilbert-Schmidt spectral theorem). Let L be a 

compact self-adjoint operator in %. Then it is always possible 
to find eigenfunctions {ip n } of L forming an orthonormal basis 
for H. 

The spectral theory of operators that are not compact and 
self-adjoint is more involved; see l20l . 

An important class of operators are the multiplication op- 
erators, which are analogous to diagonal matrices. 

Definition 9 (Multiplication operator). Let f(t) be an arbitrary 
function. The operator L defined by (Lij})(t) = f(t)ip(t), 
which performs sample-wise multiplication, is called the mul- 
tiplication operator or diagonal operator induced by f(i). 

Theorem 7. Every bounded self-adjoint operator in a separa- 
ble Hilbert space H is unitarily equivalent to a multiplication 
operator T, i.e., L = UTU^ 1 where U is unitary. 

It follows that the essence of a bounded self-adjoint operator 
is just a multiplication operator. 

Appendix B 
Proof of Elementary Properties of the NFT 

In this section, we sketch the proofs of the properties of the 
NFT stated in Section ITV^Dl 

1) If ||g(£)||Li « 1, then y 2 (t,X) and q 2 (i) terms can be 
ignored in ( f33l > and ( f34b . From the resulting equations, 
it follows that there is no discrete spectrum and q(X) —> 
Q(\). The quadratic terms are introduced by the NFT 
to account for the nonlinearity. 

2) This follows from 1) in above, and that when \a\ « 1, 
the squared terms representing the nonlinearity can be 
ignored. 

3) This follows from replacing q(t) with e^q(t) in (|33l 
and d34l 



4) Scaling the Zakharov-Shabat system (fJTJ as t' = t/a, 
we obtain the desired result. Note that if sgn(a) < 0, 
boundary conditions at +co are interchanged. 

5) Property 5) and 6) follow by replacing q(t) with q(t—to) 
and e 2jwt g(t) in d33l and d341 l. and accordingly changing 
variables. 

6) This is the statement of ( f36l >. 

7) The following identity, known as the trace formula, can 
be easily proved for the nonlinear Fourier transform lfl2l 



N 



Here c„ are the secondary constants of motion, i.e., 
quantities, directly in terms of the time domain data, 
which are preserved during the flow of the NLS equa- 
tion. The first few ones are the energy 



Cl 



UU 



dt, 



momentum 



UU 

C2 = h \ q{t)qt{t)dt > 



and the Hamiltonian 



C3 



00 

J (\ q (t)f-\q t (t)\ 2 )dt. 



Parseval's Identity is the trace formula evaluated at n = 
1. 

8) Let q(t) be supported in the interval [£1,^2] and let 
K(q(t), t\, £2) denote a propagator (linear transforma- 
tion) which maps v(t,X) in OTT i from v(ti,X) to 
v(t2, A). The propagator is structured as 



K 



a{X)e^- 1 ^ -b*(X*)e^ x ^ +t ^ 
6(A)e- jA ( t2+tl > a*(A*)e-^ A C t2 - tl ) 



Let qi(t) and q2(t), supported, respectively, in the inter- 
vals [£1,^2] and [£2,^3], ti < t-2 < £3, correspond to the 
propagators Ki(qi(t),ti,t 2 ) and K 2 {q2{t) 1 t 2 ,h). Then 
q(t) = qi(t) + q 2 (t) is supported in [£1 , £3], and from 
linearity corresponds to the propagator K = K2K1 



K 



a 2 {Xy x ^- t ^ -&*(A*)eJ' A ( t3 + t2 ) 

b 2 {X)e-^ +t ^ a*(A*)e-J A (* 3 -* 2 ) 

ai(A)eJ' A ( t2 - tl ) -6*(A*)eJ' A ( t2+tl ) 

6i(A)e^' A ( t2+tl ) al{X*)e- jX ^-^ 

The lxl and 2x1 elements are 



Kll = e^~^ MA)a 2 (A) - h(X)bt(X*)) , 
K 21 = e-^ t3+tl > (oi(A)6a(A) + 6i(A)a|(A*)) . 

Comparing with K X \ = a(A)e jA(t3 " tl) and K 2 \ 
&(A)e~- jA ( t3+tl ^, we get the desired result. 
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Appendix C 

RlEMANN-HlLBERT FACTORIZATION PROBLEM 

A. Preliminary 

Recall that a complex function f(z) = u(x,y) + jv(x,y) 
in the complex plane z = (x, y) is differentiable at a point 
(xo,yo) if and only if the partial derivatives u x , u y , v x and 
v y are continuous and 

u x = v y , u y = —v x . (43) 

The compatibility conditions ( |43l are called the Cauchy- 
Riemann conditions and are obtained by equating the limit 
Sz —* along the real and imaginary axes. This means that, 
unlike real functions, differentiability of a complex function 
imposes a constraint between the real and imaginary parts of 
the function. 

A function f(z) is said to be analytic at (xo,yo) if it 
is differentiable in a neighborhood of that point. If f(z) is 
analytic in an open region il of the complex plane, then 
a power series representation of f(z) is convergent in f2. 
Existence of a power series representation has a number of 
interesting consequences. For instance, the zeros of a non- 
zero function analytic in an open region of C are isolated 
points in l24l . 

Lemma 8 (Plemelj formulae). Let C be any smooth, closed, 
counter-clockwise, contour in the complex plane and let f(x) 
be any function satisfying a Holder condition on C defined by 

\f(t)-f(r)\^k\t-r\ a ,k>0, Vt.reC, 

for some < a ^ 1. Then the projection integral 



c 



(44) 



is analytic everywhere in C except possibly at points £ on the 
contour C (where F(C,) is not defined). If C, approaches C 
along a path entirely inside the contour C, then 



f(0 , 1 ... f /(*) 



dz. (45) 



If £ approaches C along a path entirely outside the contour 
C, then 

F~(0 ± lim F(C) = -ttf- + -L p.v. f M-Az. (46) 

z^C 2 27TJ J z - C 

c 

Here p.v. denotes the principal value integral defined by 



z - C J z - c 



p.v. I 

c c-c e 



dz, 



in which C e is an infinitesimal part of C centered at z = C, 
and with length 2e. 



Proof: See (261. ■ 
The projected function F(Q is a sectionally analytic func- 
tion of ( with respect to C, i.e., it is analytic in sections C + 
(the interior of C) and C~ (the exterior of C), and the limits 
F ± (Q exist (as given by (05]) and d46b). 



A consequence of Lemma [8] is that F-(£) satisfies the 
following jump condition on the boundary C 

F+(t)-F~(t) = f(t). 

The projection operator therefore produces functions which 
are analytic almost everywhere, except on a contour where it 
experiences a jump in its limits. 

1) The scalar Riemann-Hilbert problem: In the scalar 
Riemann-Hilbert problem, the task is to find functions f + (z) 
and f~(z), analytic, respectively, inside and outside a given 
smooth closed contour C, such that on C 



f + (t)=g(t)f-(t) + h(t) teC, 



(47) 



where h(t) and g(t) (with g(t) for all t e C), are given 
functions satisfying a Holder condition on C. 

Both unknowns f + (z) and f~(z) can be determined from 
the single boundary equation (|47| |. using the projection op- 
erator (144-b . To see this, first consider the homogeneous case 
where h(t) = 0. One can rewrite (ffTT i as a jump condition 

log/ + (z) - log/~(z) = logff(z) zeC. 

Functions log/ + (z) and log/~(z) can be viewed as portions 
of a single sectionally analytic function log/(z) which is 
analytic in C + and and on boundary C its limits jump as 
\og g(t). In view of the projection operator P (l44l . consider 

log 5(A) 



log/(z) 



27TJ 



A — 



-dA. 



If log g(t) satisfies a Holder condition on C, then log/(z) is 
analytic strictly inside and outside C. On C, we can define 
log/ + (z) and log/ _ (z), respectively, as equal to the limits 
(|45| > and (|46| >. The function obtained in this way satisfies (|47] > 
and has the desired analyticity properties. 

Note however that, unlike g(t), log g(t) in the integrand 
may not satisfy a Holder condition. To resolve this issue, 
we can multiply g(t) by a decaying factor t~ k , for a 
suitable k, to make t~ k g(t) Holder, and obtain = 
(t~ k g(t)) (t k f-(z)). Therefore, defining 

^) = exp^-|^ x -fidA 

we have the following solution for the homogeneous RH 
problem: 



F{z) zeC^ 
F+(z) zeC, 



and 



f - (z)= U- k F(z) zeCT, 
1 { ' \z- k F-(z) zeC. 

Here k can be chosen so that t~ k g(t) is continuous and the 
total phase change of \ogt~ k g(t) is zero along the closed path 
C. 

The solution f-(z) is called the fundamental solution to 
the scalar RH problem. From the homogeneity of d47l i. one 
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can obtain other solutions by multiplying f-(z) by any entire 
function in C. 

We can generalize the above procedure to solve the non- 
homogeneous Riemann-Hilbert problem ( |47T >. In this case, we 



can find a factorization g(t) 



9 + (t) 



by solving a homo- 



geneous Riemann-Hilbert problem with boundary conditions 
g+(t) = g(t)g-(t). Then <@7|> is reduced to 

/+(*) f-(t) h(t) 



g+(t) g-{t) g+(ty 

which, as before, can be solved in closed form using the 
Plemelj formulae. 

2) The matrix Riemann-Hilbert problem: When formu- 
lating the inverse nonlinear Fourier transform, we face a 
matrix Riemann-Hilbert problem d39b . Matrix RH problems 
are generally more involved and may not allow closed-form 
solutions (26). As we will see in the Appendix [F] for the 
particular matrix RH problem d39l l, the projection operator ( l44l 
is sufficient to solve the problem. 

Appendix D 
Proof of Lemma|5] 

Analyticity of the canonical eigenvectors is directly a 
property of the Zakharov-Shabat system ( I2TI 1. Let us scale 
canonical eigenvectors as V 2 = u 2 e-? , V 2 = v 2 e~^ t ,V 1 = 
y i e -j\t an( j yi _ ~i e jAt Xransforming the Zakharov-Shabat 
system (I2TI 1. the scaled eigenvectors satisfy 



Yr 



V} 



q 
-q* 2jX 

-2jX q 
-q* 

q 
-q* 2jX 

-2jX q" 
-q* 0, 



y. 


V 2 (-m) -- 


1 v\ 


V 2 (-oo) = 


)+. 


V^oo) = 




VHco) = ( 



or 



(48) 





or 



Let us, for instance, solve for the canonical eigenvector V 2 
in (|48]l. Considering the q terms as an external force and using 
the Duhamel formula 1271 . (|48| | can be transformed into its 
integral representation 



V 2 (t,X) 



+ 



J h(t-t',X) 



9i P)V 2 (t',X)dt', (49) 



-q*(t') 



where the system impulse response h(t, A) is 



h(t,X) 



u(t) 








2jXt 



u{t) ' 



(50) 



and we have ignored transient terms since the boundary 
condition starts at t = —00. 

The analyticity of eigenvectors can be seen intuitively at 
at this stage. The impulse response (T50b involves the term 
e 2jA *u(i) and hence d48l > is well defined in C + if q e 
L 1 (R). The impulse response for the V 1 equation involves 
— e _2j u{— t) and hence it is bounded in the same region. 



The impulse response for V 2 and V 1 have terms proportional 
to e~ 2 -' At w(i) and — e 2 -' At w(— t), respectively, and therefore 
these eigenvectors are analytic in C~ for q e L 1 (R). 

A more precise argument proceeds by solving (l49l explic- 
itly. Duhamel's integral (l49l is of the form of a fixed-point 
map 

V 2 = (T) +T(V 2 ), 





where T is the linear operator underlying (l49l . A candidate 
for the solution is the sum 



V 2 = £ U k 



(51) 



fc=0 



where U k satisfy the iteration 



U k+1 = T(U k ) +(h, U 



The first few terms are 



U 1 



, u 2 



I I g (i 2 ) g *(ii)e 2jA( * 2 - tl) diidi 2 

to = — CO ti = — 00 





, (52) 



(53) 



and the k term is recursively defined by 



uM = (o) + I 



q(t'W(t',X) . 
-q*(lf)U^(t',z)e 2 ^ t - t \l at - (M) 



By induction on k, as (l54l suggests, if U k is analytic and 
q{t) e L 1 (M), then U k+1 is analytic. In addition, the series 
(|5TT i is uniformly convergent on t and thus V 2 is analytic in 

C+. 

Similarly one proves the analyticity of the other canonical 
eigenvectors in their corresponding region. 

Appendix E 

ASYMPTOTICS OF CANONICAL EIGENVECTORS AND 

Nonlinear Fourier Coefficients when |A| » 1 

If A e C + and |A| » 1, then -j-^ = ^ + 0(A- 2 ) 
and, taking the inverse Fourier transform, we can approximate 
e 2 i xt u(t) = —^5(t) + 0(A- 2 ). Substituting into (ED, 63. 
), for |A| » 1 we obtain 



+ 0{X- Z ). (55) 



A similar asymptotic expression can be derived for V 1 (A » 1) 
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If | A | — > oo, g can be assumed zero in (f2Tb compared to jX. 
Thus v(t, A) approaches the boundary conditions at t = +00. 
Therefore 

o(A) = (v 2 (t,X),v 1 (t,X)) s = (v 2 (+w,X),v 1 (+cx,,X)) s 
— ► (u 2 (-cx), A), u 1 (+co, A)> s = 1. (56) 

Similarly, it is shown that 6(A) — » as |A| — » 00. 

Appendix F 
Solution of the Riemann-Hilbert problem 



In Section IVHI the inverse nonlinear Fourier transform was 
formulated as an instance of the Riemann-Hilbert factorization 
problem. Using hints given in the Appendix [C] the resulting 
factorization problem can be solved in a simplified manner via 
an appropriate contour integration. 

Dividing both sides of the projection equations (f27b-(f28T> by 
<x(A)(A— Q, for parameter £ e C + , and integrating on the real 
axis — 00 < A < 00, we obtain 
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2^7 J 
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2^7 J 
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7(A)e^ At 1/ 1 (^ A) 
A-C 



dA, (57) 



in which the integration is performed on the open path z = 
A + j^{C)'~ 00 < A < 00. Without loss of generality, we can 
assume that ( is real, as otherwise, we can shift the real line 
to the place of £, by replacing f(z) with f(z — j'3f(C)) and 
integrating from — oo + j'Ss^C) to + G0 + J^tO- The integration 
path is assumed to pass the singularity A = £ from above in 
all the integrals. 

Cauchy integrals in ( T5Tb are computed from the residue 
theorem. The integration path — 00 < A < 00 can be closed in 
the upper or lower half-planes. To compute the first integral, 
we close the path in the upper half-plane and denote the 
resulting closed contour by C 1 ^, i.e., the upper half plane 
and excluding the singularity z = £ 
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(58) 



where, in the second line, when R — » 00, we have used the 
asymptotic values (T55t and (T56b in Appendix [E] Note that we 



assumed that eigenvalues Xj are all simple zeros of a(A), i.e., 
no multiplicity. 

To compute the second integral in (|57| |. we close the inte- 
gration path in the lower half-plane and denote the resulting 
closed contour by C^, i.e., the lower half-plane and including 
the singularity z = £ 
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A= — 00 
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J. i Y^Adz- lim 
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2wj I 
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(59) 



The last integral in ( TSTb is left uncomputed, because the 
boundedness of e 2jAt depends on the sign of t. For t > 0, we 
can consider C^- which leads to the expression ( l58l multiplied 
by u(t). For t < 0, we should inevitably consider C^, where 
poles of V 1 (x, X) are unknown. As a result, this integral is 
left untreated. 

Using (l58l and (|59l in d57l l, we obtain an integral equation 
relating canonical eigenvectors V 1 and V 1 to q(X) and q(Aj) 



N - 
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+ 



2^7 I 



g(Aj)e^W^ A,) 
k - X t 

g(A)e 2 J' At U 1 (i I A) 



A=-oo 



A-C 



dA. 



(60) 



This is the first equation in the Riemann-Hilbert system d40b - 
Since d60l ) holds for any ( e C, evaluating d60l l at ( = Xj, 
j = 1, . . . , N, produces the third equation in d40b . A similar 
integral equation for V 2 and V 2 gives the remaining equations. 
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